Lecture 5: Clustering and Adaptation

نویسنده

  • Mark Hasegawa - Johnson
چکیده

The state index should encode all context information that might influence the acoustics: the third state of the /t/ in “a tree” should be different from the third state of the /t/ in “a tip,” because they are acoustically different. Likewise, lexical stress, phrase position, glottalization, and dialect might matter. Unfortunately, we never have training examples sufficient to learn the likelihood function associated with every possible combination of context variables. Therefore, we use a three-step strategy: (1) learn the mean and variance of MFCCs associated with every combination of context variables available in the training database, (2) cluster the context vectors, using acoustic similarity as a metric, ensuring that we have an adequate number of training examples from each phone, (3) learn a complete mixture Gaussian PDF for each clustered context-dependent phone. There are two ways in which similarity-based clustering can be performed in HTK:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of the learning of two notations: A pilot study

Introduction: MICAP is a new notation in which the teeth areindicated by letters (I-incisor, C-canine, P-premolar, M-molar)and numbers [1,2,3] which are written superscript and subscripton the relevant letters. FDI tooth notation is a two digit systemwhere one digit shows quadrant and the second one shows thetooth of the quadrant. This study aimed to compare the short termretention of knowledge...

متن کامل

Lecture 6 — April 16 Lecturer

Last time, we introduced the task of hierarchical clustering, in which we aim to produce nested clusterings that reflect the similarity between clusters. This contrasts sharply with our former discussion of “flat” or structureless clustering methods like k-means which do not model relationships between clusters. In this lecture, we will continue our discussion of the standard model-free approac...

متن کامل

Dynamic language model adaptation using presentation slides for lecture speech recognition

We propose a dynamic language model adaptation method that uses the temporal information from lecture slides for lecture speech recognition. The proposed method consists of two steps. First, the language model is adapted with the text information extracted from all the slides of a given lecture. Next, the text information of a given slide is extracted based on temporal information and used for ...

متن کامل

The Application of Combined Fuzzy Clustering Model and Neural Networks to Measure Valuably of Bank Customers

Currently, acquisition of resources in banks is subject to attraction of the resources of banking customers. Meanwhile, the Bank’s valuable customers are one of the best resources to make profit for banks. Several different models are introduced for evaluation of profitability of the customers; but most of them are classical models and they are unable to evaluate the customers in complete and o...

متن کامل

Lecture 16: Wireless bit-rate adaptation

This lecture, we’ll continue with the physical layer, but applied to wireless networks instead of wired networks. In particular, we’ll talk about one problem that occurs in the wireless physical layer: wireless bit-rate adaptation. We’ll start by defining bit-rate adaptation. Then we’ll talk about bit-rate adaptation in an idealized model of the wireless environment. We’ll then discuss bit-rate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009